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The objective of this article 1s to create and train 


an artificial neural network based on a data set 
containing various climatic parameters and future 
fire area as an output parameter that the authors 
intend to predict. Such a “set” of data is usually 
available for research and study. Before training 
the neural network model, the data set is divided 
into two samples: a sample for training, which is 
about 90% of the set; and a sample for testing the 
trained model. In setting the task, the authors se- 
lect and analyze the known data on the fires that 
occurred in Montesinho Park, compare the mod- 
els trained on these data with and without normal- 
ization. As a result, two examples are given of a 
qualitative demonstration of graphs of absolute 
error changes of fire areas, which are projected 
using the created and trained model. 


Keywords: burning area, machine training, mod- 
el, neural networks, Keras, forecasting, forest 
fire. 
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[lenbro HacTOAMIeH padorTs! ABIIAeTCA CO3aHve U 


oOy4eHHe MCKYCCTBeHHOM HeMpOHHOM ceTH Ha 
OCHOBe HaOopa JIaHHbIX, COepxKalllux pazM4- 
Hble KJIMMaTHuecKHe HapaMertpbl UW OyyLyto 
M10wWaytb WOxKapa B KayeCTBeE BbIXOZHOTO Ipo- 
THO3MpyeMOro TapaMetpa. Tako Hadop TaHHbIx 
ABJIACTCA, KAK TpaBMJIO, JOCTYMHbIM JIA UCcIe- 
WOBaHuA u u3sy4ueHusA. Ilepeaq oOy4ueHueM MOJeEIM 
HeMpOHHOM cCeTH HaOoOp JaHHbIX pa3jielIMHOT Ha 
Be BbIOOpKH — BbIOOpPKa JIA OOYYeHHA, KOTO- 
pasa coctaBlideT OKOIO 90 % oT Hadopa, u BbIOOp- 
Ka JIA TECTHPOBaHHA OOyYeHHOU Moye. B no- 
CTaHOBKe 3aJJa4u aBTOPbI BbIOMpPAarOT HU aHasIM3H- 
PYFOT W3BeCTHbIe JaHHbIe O MoOKapax B MapKe 
Moute3nHb0 (Montesinho), cpaBHuBaroT MOJIeIIN, 
OOy4eHHbIe Ha STUX JAHHbIX C HOPMasIN3alMen 
Oe3 Hee. B KauecTBe pe3yIbTaTa IIpHBeJeHbI Ba 
puMepa rpaduKOB W3MeHeHHA aOCOJIKOTHOH 
OIUMOKUH TWIOWMalen WoxKapa, IpOrHO3HpyeMBIX C 
TIOMOINIbIO CO31AaHHOM UW OOYYeCHHOM MOEN. 
Kuro4ueBble CJIOBa: TOMaqb TOPeHusA, MallIvH- 
Hoe oOOyyeHHe, MOeIb, HEMpOHHbIe ceTu, Keras, 
IIPOTHOSMpOBaHHe, JIECHOM MOx*Kap. 


Introduction. A forest fire is a natural and uncontrolled spread of fire over forest areas. Accord- 


ing to the Federal Forestry Agency in a week from June 3 to June 9, 2019 in 45 regions of Russia, forest 


fire forces and contractors extinguished 354 forest fires on the area of 5783.2 hectares, including 98 fires 


on the area of 1790.05 hectares, which were extinguished during the weekend of June 8-9. Due to smoke 


in fires, about 300 thousand people die every year. As a result of the combustion of biomass, an aerosol- 


gas mixture 1s formed, which represents an ecological and toxicological risk for humans. 


Fire-fighting service personnel should be provided with the most effective fire-fighting equipment 


and equipment for natural phenomena elimination. However, often this is not enough to fight this danger- 


ous phenomenon effectively. Strategic planning and resource allocation, such as the provision of a suffi- 
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cient number of fire-fighting aircrafts or ground crews, can significantly improve the chances of fire con- 
trol. But you need to calculate the amount of resources that can take a lot of time. 

One way to solve this problem can be the use of neural networks. In the present work, the authors 
used the data on fires in the Montesinho Park in Portugal to train and test the neural network. This set of 
data is available for research and work [1]. The authors use the Keras neural network library [2], written 
in the Python programming language [3, 4]. 

Data preparation. The Montesinho Park fire data were chosen as a training material for the neu- 
ral network model due to the fact that the complex fire hazard indicator of V. G. Nesterov used in the 
Russian Federation contains fewer parameters, and this may be the cause of lower results in the training 
of the model. The set of parameters used by the authors, in addition to traditional ones, contains the fol- 
lowing parameters: moisture content of forest litter and soil, flame characteristics, anthropogenic factor 
and thunderstorm activity. The following are additional parameters of the rating system of forest fire dan- 
ger, which were used in the formation of this set [5]: 

¢ probability of fire (Fine Fuel Moisture Code, FFMC); 

¢ coal moisture rate (Duff Moisture Code, DMC); 

¢ drought rate (Drought Code, DC); 

¢ index of the initial spread system (Initial Spread Index, ISI). 

All meteorological data for the calculation of the above mentioned components can be requested 
from the nearest meteorological service. Since this data set contains quite a lot of climatic parameters, 
with the help of the created and trained model it will be possible to predict the future area of a fire not on- 
ly for the Montesinho Park, but also for any other similar territory. 

Complete data in the set: 

¢ X — X-axis spatial coordinate on the Montesinho Park map: from | to 9; 

¢ Y — Y-axis spatial coordinate on the Montesinho Park map: from 2 to 9; 

¢ "month" — month of the year: from January to December; 

¢ "day" — day of the week: Monday to Sunday 

¢ FFMC — the index of ease of ignition of the fuel from the FW/ system over the interval 18.7— 
96.2; 

¢ DMC — the index of coal moisture content rate from the FWI/ system over the interval 1.1 to 
291.3; 

¢ DC — the index of drought rate from the FWI system over the interval 7.9-860.6; 

¢ ISI — the index of initial distribution from the FWI/ system over the interval from zero to 56.1; 

¢ "Temperature" — temperature in the range of 2.2—33.3°C; 

¢ relative humidity from 15.0 to 100 %; 

¢ "Wind" — wind speed from 0.4 to 9.4 km/h; 

° outside rain from 0.0 to 6.4 mm/m7: 

¢ "Area" — burned forest area from 0.00 to 1090.84 ha. 

All the parameters in the set are changed in different ranges. In order to improve the prediction ac- 
curacy of the model, it is necessary to normalize the data. One way to normalize the data is to subtract the 
mean from each parameter and divide it by the standard deviation. After these actions, the average value 
will be zero and the variance will be one. In this case, the data in each column will vary from -1 to +1, but 


with this method of normalization, some columns may have negative values, which may not be the case 
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for some parameters. You can use the MinMaxScaler().fit_transform() procedure to resolve this problem 
[6], which converts all data to the range O ... +1. This model is trained by "supervised training”. In this 
case, the data is divided into two parts — the data for training and the correct answers for this data. The 
data for training are needed to train the model, and the answers are needed to recalculate the weights on 
the edges of the neural network graph when the predicted value and the actual value do not match. Before 
training, we will randomly divide this data into a training sample and a test sample. The training sample is 
part of the dataset used to train the model. It will be about 90 % of the set. The test model is 10 % of the 
dataset and is used to test the effectiveness of the model. The test data will not participate in the training 
of the model, it is used only to verify the functionality. In the following, such modeling can be associated 
with the classical construction of models and the calculation of technosphere safety indicators [7, 8]. 

Creation of a model. In the testing of different models for the dataset under consideration, the 
model with 6 layers was the best: the input layer with 24 neurons, 4 hidden layers containing 48, 96, 48, 
24 layers, and the output neuron. 

The following activation functions were used: 

¢ linear — on the first, second and fifth layers; 

¢ sigmoid — on the second and third layers, it allows you to amplify weak signals without being 
saturated with strong ones; 

¢ selu — on the output neuron, increases the convergence rate of the neural network. 

When compiling the model, "adadelta" was used as a gradient descent type optimizer. Adadelta 
updates smaller weights that are too frequently updated, but, in contrast to Adagrad, instead of the full 
amount of updates will use the average value with respect to the history of the square of the gradient. 

As an error function, which will be used by the optimizer in the error back propagation algorithm, 
we choose the standard error, as a metric- "mae", the average absolute error. 

Training. In the 500-stage of training, the average absolute error is 4.6, so the model in the pre- 
dictions will be wrong in general by 4.6 hectares, which the authors consider satisfactory. Fig. 1 shows 
the curve of the error change. 


Error in hectares 
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Fig. 1. Curve of an absolute error change at training on data with normalization 
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Fig. 2 shows a graph of an absolute error change with unnormalized data, which proves that the normal- 
ized data is better than the initial data. 


Error in hectares 
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Fig. 2. Curve of an absolute error change at training on data without normalization 


Forecasting. Fig. 3 provides a graph illustrating the results of the neural network operation. On 
the graph, the orange line is the actual burning area; the blue line is the predicted area. As it can be seen 
in the figure, the dynamics of the curves for each record is almost identical, which shows the good per- 
formance of the trained model in forecasting. 


—— Mpencka3zakHoe 
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Fig. 3. Forecasting using the trained model with the normalized data 


Fig. 4 demonstrates the forecasting graph of another model, which was trained with the help of 
unnormalized data. Obviously, the first model is more effective. 
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Fig. 4. Forecasting using the trained model with unnormalized data 


Conclusion. In this paper, a model of an artificial neural network was created and trained on a set 
of data containing different climatic parameters and the future fire area in hectares. This area is the output 
parameter that the authors are going to forecast. As a rule, this dataset is available for research and study. 
Before training the neural network model, the dataset was divided into two samples: a sample for training, 
which is about 90 % of the set, and a sample for testing the trained model. In the formulation of the prob- 
lem, the authors choose and analyze the known data on the fires that occurred in the Montesinho Park, 
compare the models trained on these data with and without normalization. As a result, two examples of 
demonstration of the absolute error graphs of the fire areas predicted by the created and trained model are 
given. 
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